A Mid-level Melody-based Representation for Calculating Audio Similarity
نویسنده
چکیده
We propose a mid-level melody-based representation that incorporates melodic, rhythmic and structural aspects of a music signal and is useful for calculating audio similarity measures. Most current approaches to music similarity use either low-level signal features, such as MFCCs that mostly capture timbral characteristics of music and contain little semantic information, or require symbolic representations, which are difficult to obtain from audio signals. The proposed mid-level representation is our attempt to bridge the gap between audio and symbolic domains by providing an integrated melodic, rhythmic and structural representation of music signals. The representation is based on a set of melodic fragments extracted from prominent melodic lines, it is beat-synchronous, which makes it independent of tempo variations and contains information on repetitions of short melodic phrases within the analyzed piece. We show how it can be calculated automatically from polyphonic audio signals and demonstrate its use for discovering melodic similarities between songs. We present results obtained by using the representation for finding different interpretations of songs in a music collection.
منابع مشابه
Mid-Level Music Melody Representation of Polyphonic Audio for Query-by-Humming System
Recently a great attention is paid to content-based multimedia retrieval that enables users to find and locate audio-visual materials according to the intrinsic characteristics of the target. Query-by-humming (QBH) is also an application that makes retrieval based on major characteristics of music, that is, "melody". There have been some researches on QBH system, most of which are to retrieve m...
متن کاملPerforming Query-by-Melody on Audio Collections
Mid-level representations are increasingly used to bridge the gap between high-level (semantic) and low-level audio representations. A mid-level representation that integrates melodic and rhythmic aspects of a music signal is introduced. The representation is formed by first performing multi-pitch detection on consecutive audio frames and then searching for dominant melodic lines within the det...
متن کاملAlgorithms for melody search and transcription
This thesis studies two problems in music information retrieval: search for a given melody in an audio database, and automatic melody transcription. In both of the problems, the representation of the melody is symbolic, i.e., the melody consists of onset times and pitches of musical notes. In the first part of the thesis we present new algorithms for symbolic melody search. First, we present al...
متن کاملA Chroma-based Salience Function for Melody and Bass Line Estimation from Music Audio Signals
In this paper we present a salience function for melody and bass line estimation based on chroma features. The salience function is constructed by adapting the Harmonic Pitch Class Profile (HPCP) and used to extract a mid-level representation of melodies and bass lines which uses pitch classes rather than absolute frequencies. We show that our salience function has comparable performance to alt...
متن کاملAudio Melody Extraction Based on Timbral Similarity
The extended abstract presents our approach to extraction of melody from audio recordings, based on timbral similarity of melodic fragments. The algorithm was submitted to MIREX 2005 competition and scored 5 among 9 submissions, with an average score of 59.18% correctly transcribed voiced and unvoiced portions.
متن کامل